Automatic Translation in Cross-Lingual Access to Legislative Databases
نویسندگان
چکیده
This paper considers the use of controlled languages for query translation in a legislative document retrieval system. Problem statement and analysis of the approach are described. The use of controlled languages is motivated by the fact that precision is very important in our case. In many information retrieval systems, the use of unrestricted language resources such as general purpose machine translation or bilingual lexica, provides better recall at the expense of precision. Ambiguities and polysemy make the search engine retrieve irrelevant documents as semantic knowledge is missing. Controlled languages help to better specify the word sense according to the domain of interest. Thus ambiguities are avoided and polysemy is specified according to the domain. We will implement our idea in the area of VAT regulation in Europe.
منابع مشابه
English-Persian Plagiarism Detection based on a Semantic Approach
Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...
متن کاملAutomatic Cross-Lingual Similarization of Dependency Grammars for Tree-based Machine Translation
Structural isomorphism between languages benefits the performance of cross-lingual applications. We propose an automatic algorithm for cross-lingual similarization of dependency grammars, which automatically learns grammars with high cross-lingual similarity. The algorithm similarizes the annotation styles of the dependency grammars for two languages in the level of classification decisions, an...
متن کاملExperiments in Cross Language Query Focused Multi-Document Summarization
The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual information robustly and efficiently, with as high quality performance as possible. Previous research activities on multilingual information access systems have studied cross-language information retrieval (CLIR), information ...
متن کاملAn Ontology-Based Architecture for Natural Language Access to Relational Databases
Natural language (NL) access to databases is a problem that has interested researchers for many years. We demonstrate that an ontology-based approach is technically feasible to handle some of the challenges facing NL query processing for database access. This paper presents the architecture, algorithms and results from the prototype thereof which indicate a domain and language independent archi...
متن کاملCross-lingual Lexical Similarity in Translation 1 Running head: Cross-lingual Lexical Similarity in Translation Semantic Access in Number Word Translation: the Role of Cross-Lingual Lexical Similarity
The revised hierarchical model of bilingualism (e.g., Kroll & Stewart, 1994) assumes that second language (L2) words primarily access semantics through their first language (L1) translation equivalents. Consequently, backward translation from L2 to L1 should not imply semantic access but occurs through lexical wordform associations. However, recent research with Dutch-French bilinguals showed t...
متن کامل